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Abstract 

Han-Kobayashi achievable region is the best known inner bound for a general discrete memoryless interference 
channel. We show that the capacity region can be strictly larger than the Han-Kobayashi region for some channel 
realizations, and hence the strict sub-optimality of Han-Kobayashi achievable region. 


1 Introduction 


Interference channel models the communication of two (or more) transmitter/receiver pairs over a shared 
medium. A computable characterization of the capacity region (C) for interference channels is a classical and 
a fundamental open problem in multi-terminal information theory. With the vast interest in wireless commu¬ 
nications and the prominent presence of interference under such settings, characterizing (or understanding) 
the capacity region of the interference channel is very central. 

The interference channel shown below models the communication of two transmitter/receiver pairs and 
is the primary model we use for this study. 


Ml 


M2 



Fig. 1: Discrete memoryless interference channel 

We follow the standard definitions of achievable rates and capacity region, as well as the standard 
notations that may be found, for instance, in [1]. The best known achievable rate region under such definitions 
is due to Han and Kobayashi [2] and an equivalent form [1] is presented below. 

Theorem 1 (Han-Kobayashi (HK) inner bound). A rate-pair {Ri, R 2 ) is achievable for the channel described 
in Figure 1 if 


</(Ai;ri|C/2,Q), (1) 

R2<I{X2;Y2\Ui,Q), (2) 

Ri+R2<I{Xi,U2;Yi\Q) + I{X2;Y2\Ui,U2,Q), (3) 

Ri + R 2 < I{X 2 ,Ui-,Y 2 \Q) + I{Xi-,Yi\UuU 2 ,Q), (4) 

Ri + R 2 < I{Xi,U 2 ;Yi\Ui,Q) + I{X 2 ,Ui-,Y 2 \U 2 ,Q), (5) 

2i?i +R2< I{Xi, U 2 ; TilQ) + /(Ai; FilC/i, C/2, Q) + /(A 2 , C/i; ^21^2, Q), (6) 

i?i + 2 i ?2 < I{X2, C/i; T2IQ) + /(A2; Y2‘\Ui, C/2, Q) + /(Ai, C/2; Ai|C/i, Q) ( 7 ) 
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for some pmf p{q)p{ui,xi\q)p{u 2 ,X 2 \q), where |C/i| < |Xi| +4, \U 2 \ < |^ 2 | +4, and \Q\ < 7. The set of 
achievable rate pairs form the Han-Kobayashi achievable region, or HK region, and is denoted by TZhk- 

The capacity region is known under a small set of interference instantiations such as strong interference 
and injective deterministic interference. The sum capacity is established for a larger class of channels such 
as Gaussian interference channel with mixed or very weak interference. In all the cases mentioned above 
the capacity region (or the sum capacity) matches the one given by TZhk- Furthermore, it was not known 
whether TZhk is the capacity region C or not. In this paper, we show that there are channel instances where 
TZhk C; thus showing the sub-optimality of the HK region. 

The main ingenuity of our work lies in the choice of the channel realizations because the computation 
of the HK region is not particularly straightforward. We study a class of interference channels, defined 
as CZI channels in the next section, where the evaluation of TZhk becomes significantly simplified^. We 
take particular channels inside this class and compute a (normalized) two-letter achievable region of the 
corresponding two-letter product channel. We show that there are many examples where the (normalized) 
two-letter achievable region considered is strictly larger than TZhk, which indicates TZhk Si C. 

2 CZI Channel 

We say that an interference channel has clean Z interference (CZI) if one of the sub channels is a clean 
channel. We choose the channel from X 2 to Y 2 to be clean as depicted in Figure 2 and study its HK region. 



Fig. 2: Discrete memory less CZI channel 

The following proposition reveals an equivalent characterization of the HK region for CZI channels which 
simplifies its evaluation. 

Proposition 1. The HK region of a CZI channel is identical to the set of rate pairs (i?i,i? 2 ) that satisfy 

Ri<I{Xi;Yi\U2,Q), (8) 

i?2 < H{X2\Q), (9) 

Ri + R 2 < I{Xi,U 2 ;Yi\Q) + H{X 2 \U 2 ,Q) (10) 

for some pmf p{q)p{u 2 \q)p{x 2 \u 2 )p{xi\q), where \U 2 \ < \X 2 \ and |(5| < 2. 

Proof. First of all, it is a simple exercise to note that the HK region of a CZI channel reduces to the three 
constraints above by setting Ui = (p. Hence, the above region is a subset of the HK region. 

Conversely, (8) is identical to (1) of the HK region. (9) and (10) are respectively looser constraints than 
(2) and (3) of the HK region, which makes the above region larger than the original HK region. Thus proving 
equivalence. 

Note that the changes in cardinality of U 2 and Q follow from standard applications of cardinality reduction 
techniques all while the underlying region remains the same. Therefore, we do not have to take these changes 
into account when talking about the two regions’ equivalence. □ 

^ An earlier attempt was made by some of the authors [3] along very similar lines where the sum-capacity of very weak 
interference channels was studied. However, they were unable to identify examples where the (normalized) two-letter achievable 
sum-rate of a two-letter product channel becomes larger than the original TZhk- Note that CZI channels considered here are a 
further subclass of very weak interference channels; on the other hand we study a weighted sum-rate rather than the sum-rate. 
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The first result that we present below is a result that shows the optimality of the HK region along certain 
directions. 

Proposition 2. For a CZI channel, 

max(Ai?i + R 2 ) = niax(Ai?i + i? 2 ), VA < 1. 

Tlhk C 

Proof. A standard converse/outer-bound argument proves that treating interference as noise is optimal. 
n{\Ri + R 2 ) - nt <' H{Xf\Xf) + \I{Xf- Yf) 

n 

= ^ H{X2i\Xu) - i{X2i-, xr^xt^ixu) + \i{xf-, yii|yi"+i) 

i = l 
n 

<Y,H{X2i\Xu) - I{X2i-,Xr\Xi-^\Xu) + \I{Xf ,YZ+p,Yu) 

i = l 
n 

= ^ H{X2i\Xu) - I{X2i; XrfXt^lXu) 

i = l 

n 

+ \J2 {HXf, yn+i, xr'; Yu\Xf,Yn+,)) 

i=l ^ 
n 

^^Y.H{X2i\Xu)-I{X2i-,Xf''\Xt^\Xu) 

i=l 

n 

+ \J2 {nxf,Yf,+i,xr^;Yii) - /(y^+i; A2,ixr,xr')) 

i = l 
n 

= Y,H{X2i\Xu) - (1 - \)I{X2i-,Xr\Xi-^\Xu) 

i = l 

n 

- xY, (^I{X2i;Xr\Xt\Yri+i\Xii) - I{Xf,Yn+uXt^;Yii)) 

i = l 
n 

= ^i^(A2i|yii) - (1 - A)7(A2i; Xr^lAii) + A/(Aii;yii) 

1 = 1 

n 

Y (I{X2^■,xr\x^\Y^i+,\Xli)-Hxr\xt\Y^i+uY^.\xu)) 


-A 


= X^i7(y2i|yii) + A7(Aii;yii) - (1 - A)7(A2i;yrbXr^lAii) - XI{X 2 i-,Xr\Xr\Yfl+i\Yu,Xi 

i = l 

< n (max(77(A2) + A7(Ai; y)), 

where (a) follows from Fano’s inequality, (b) Csiszar sum identity and (c) properties of the Markov chain 
formed by Yu - (Xu,X 2 i) - (XY , X^'-\Yfl^^) . 

Since e > 0 is arbitrary, we see that any achievable rate pair must satisfy 

A77i + 772 < max H{X 2 ) + A7(Xi; y), 

Pl(xi)p2{x2) 


which is achievable by treating interference as noise, or more precisely, setting U 2 = (f> in the HK region. 
Hence, the proposition is established. □ 

On the contrary, we will see that, for some channels, 

max(A7?i + R 2 ) < max(A7?i + R 2 ) 

Tlhk C 

when A becomes larger than 1. The following lemma helps us evaluate the quantity max(Ai?i + 7?2). 

T^hk 
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Lemma 1 . For a CZI channel, for all X > 1 
max(Ai?i + i?2) = max |/(Xi, X2; Yi)+ 

T^hk Pl(xi)p 2 ix 2 ) 1 . 


£ \H{X 2 ) - /(X2; Yil^i) + (A - l)/(Xi; Yi)] |, 

P2{X2) J 


( 11 ) 


where €[f{x)] of f{x) denotes the upper concave envelope of f{x) over x. [ 4 ] 

Proof. For any (i?i,i?2) C Ti-hk, there must exist a distribution p(g)p2(M2, a;2|(7)pi(xi|g) such that 
XRi + i ?2 < (A - 1 )J(Xi; Y1IC/2, g) + /(Xi, 1/2; Yi|g) + H{X 2 \U 2 , Q) 

= /(Xi,X2; Y2|g) + h{X2\U2,q) - /(X2; Y2|[/2,Xi, g) + (a - i)/(Xi; Yi|[/ 2, g) 

J(Xi,X2;Y2|g)+ £ [iI(X2|g)-/(X2;Yi|Xi,g) + (A-l)J(Xi;Yi|g)], 

P2{x2\q) 

where (d) follows directly from the definition of the upper concave envelope. Since Q computes an average, 
and since the average is less than the maximum, we obtain that 

max(Ai?i+i?2) < max |/(Xi, X2; Yi) + £ [id(X2) -/(X2; Yi|Xi) + (A - l)/(Xi; Yi 

T^hk Pl(xi)p2{x2) t P2{X2) 



On the other hand, for any P2 (m 2, a;2)pi(a;i), the following rate pair 

(i?i,ii-2) = (/(Xi;Yi|C/2), H{X2\U2) + I{U2-,Y^)) 

belongs to IZhk as it satisfies the constraints. Thus, 


max(Ai?i + R2) P max 

T^hk P2(u2,X2)pi{ 


max 

P 2 (u 2 ,X 2 )pi{ 

max 

P 2 {u 2 ,X 2 )p 


^ {a/(Xi;Yi|C/ 2) +id(X2|C/2) +/(C/2; Yi)} 

{/(Xi, 1/2; Yi) + H{X 2 \U 2 ) + (A - l)/(Xi; Y1IC/2)} 

^ {/(Xi,X 2 ; Yi) + id(X 2 |C/ 2 ) - /(X2; Yi|C/ 2 ,Xi) + (A - l)/(Xi; Y1IC/2)} 


(e 


y max /(Xi,X2;Yi)+ £ [iJ(X2) -/(X2; Yi |Xi) + (A - l)/(Xi; Yi)], 

P2(x2)Pl(xi) P2(X2) 


where (e) also follows directly from the definition of the upper concave envelope, see [ 4 ]. This establishes 
the converse and completes the proof of the lemma. □ 

By viewing the channel use across two consecutive time-slots as the channel use of a single time-slot of 
the corresponding product channel, we obtain what is usually termed the two-letter realization of the original 
channel. For the two letter product channel of a CZI channel, the transition probability satisfies 


q(l/ii2/i2|a;ii,a;i2X2i,X22) = q(j/ii|a;iiX2i)q(yi2|a;i2, 2 : 22 ), 

where q is the transition probability of the CZI channel. 

Proposition 3 . The set of rate pairs satisfying 

Rl — 

R2 = ■^H{X 21 , H 22 \Q), 

for some pmf p{q)p(xii,xi2\q)p(x2iX22\g) with \Q\ < 2 zs achievable by the original channel. 

Proof. This rate pair is precisely the treating interference as noise rate pair of the two-letter channel, and 
the normalization by ^ is due to the fact that we code over two time-slots of the original channel. □ 

We denote this (normalized) two-letter HK region as TZtwo. 
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2.1 Sub-optimality of the HK region 

In this part we provide several CZI channels for which, for some fixed (A > 1 ), max(Ai?i -I- R2) becomes 

'^two 

larger than max(Ai?i + R2), which proves the sub-optimality of the HK region. 

'R-hk 

Examples are of channels with binary input and output. A 2 x 2 matrix is used to represent the channel: 


^{yi\xi,X2) 


'P{Yi = 0|Ai, A 2 = 0,0) P{Y^ = 0|Ai, A 2 = 0,1)' 
P{Yi = 0|Ai, A 2 = 1,0) P(yi = 0|Ai, A 2 = 1,1) 


The fact that X2 is binary allows us to compute the upper concave envelope in Lemma 1 with extremely 
high precision. 

The channels in Table 1 are obtained using numerical methods. We prove, as a demonstration, in the 
Appendix that the difference in rates of the first channel listed above is not due to numerical errors and 
that the maximum single-letter rate is indeed strictly smaller than the maximum (normalized) two-letter 
rate achieved by the corresponding two-letter product channel. 


2.2 Intuition and a natural modification 

In this section, we present an intuition as well as a coding strategy motivated by this intuition that indicates 
how one may improve on the Han-Kobayashi encoding scheme. 

The counterexamples we generated in the last section had the following feature: even though A was 
strictly larger than one, the optimal U2 that yielded max(Ai?i -I- R2) was still the trivial random variable; 

R'hk 

implying that there were distributions pi{xi) and ^2(2:2) such that 


i?i =/(Ai;yi), R 2 =I{X 2 -,Y 2 ) = H{X 2 ) 

yielded the maximum weighted sum-rate. 

Suppose we now go to the two-letter product channel and take the product distribution of the marginals 
that yielded the one letter maximum as the transmitter distribution, clearly we would get the same rate. It is 
an easy exercise to verify that /(Xi; Yi) is convex in X2 (utilizing the fact that Xi and X2 are independent). 
Thus a perturbation of the product distribution into two distributions that preserve the average would 
reduce R2 = ^H{X2i,X22) but increase Ri = i/(Xii, X12; Yu, Y12). Since we are interested in XRi -|- R2 
with A > I, it is conceivable that such a perturbation would increase the weighted sum-rate. 

Note that A2 acts like a state variable on the communication of the channel between Xi and Yi. If the 
channel from Xi —>■ Yi, with X2 as the state, is not memoryless, we know that the optimal code distributions 
on A" are not independent distributions. 

For instance, if one creates X^ according to a first order Markov process, the channel from X" to Yj" 
becomes a channel whose state varies like a first order Markov process. For such a coding strategy, one could 
achieve R2 = H{X2), Ri = ( 7 (Xi;Yi), where H{X2) denotes the entropy rate of the Markov process XJ 
and ( 7 (Xi; Yi) denotes the capacity of the channel whose state varies according to X^. 

Note that in general C'(Xi; Yi) does not have a closed form and is quite hard to compute; but this scheme, 
as opposed to block coding, appears to the authors to be a natural fit for interference channels. It would 
also explain why i.i.d. coding (in the sense of Han-Kobayashi) might not be optimal for a CZI channel. 

3 Conclusion 

We have shown in the paper that Han-Kobayashi achievable region is strictly sub-optimal, which makes 
finding new ways of modeling achievable regions for interference channels almost a necessity in the future. 


4 Appendix 

Analysis of a particular example 

Consider the CZI channel where Figure 3 the depicts q( 7 /i|xi,X 2 ) as two point to point channels Xi —)■ Yi 
for different choices of X2. Our purpose is to show the details of computation TZhk when A = 2 . 




Tab. 1: Table of counter-examples 


A 

channel 

max(Ai?i -1- R 2 ) 

TZhk 

max(Ai?i -1- R 2 ) 

ntn,o 

2 


1.107516 

1.108141 

2.5 


0.204581 0.364813 
0.030209 0.992978 


1.159383 

1.169312 

3 


0.591419 0.865901' 
0.004021 0.898113 


1.241521 

1.255814 

3 


'0.356166 0.073253' 
0.985504 0.031707 

1 

1.292172 

1.311027 

3 


'0.287272 0.459966' 
0.113711 0.995405 


1.117253 

1.123151 

4 


'0.429804 0.147712' 
0.948192 0.002848 


1.181392 

1.196189 

4 


'0.068730 0.443630' 
0.011377 0.954887 


1.223409 

1.243958 

5 


'0.969199 0.564440' 
0.954079 0.061409 


1.351229 

1.372191 

5 


'0.943226 0.447252' 
0.950791 0.024302 


1.231254 

1.250564 

6 


'0.943292 0.045996' 
0.589551 0.202487 


1.069405 

1.076932 

6 


'0.714431 0.019375' 
0.955918 0.448539 


1.528508 

1.541781 

7 


'0.058449 0.558649' 
0.194915 0.959172 


1.424974 

1.452769 

7 


'0.033312 0.876067 
0.286125 0.992825 


1.179438 

1.187867 

10 


'0.307723 0.874843' 
0.032090 0.710535 


1.370830 

1.388674 

15 


'0.946802 0.311909' 
0.730770 0.155075 


1.391596 

1.406325 

100 


'0.382410 0.081474 
0.584797 0.241840 


3.754016 

3.789316 

100 


'0.673979 0.194596' 
0.781192 0.285216 


1.711938 

1.730715 
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Fig. 3: Binary CZI channel 


Let P{Xi = 0) = p and P{X 2 = 0) = q. By Lemma 9 


max(2i?i + R 2 ) 
Ti-hk 


max 

Plixi)p2{x2) 



X2;Yi) + 


€ \H{X2) - I{X2;Y,\Xi) + I{Xp,Y^)] }. 
P2ix2) J 


( 12 ) 


Define 


fip,q) :=HiX 2 ) - I{X 2 -,Yi\X,) + I{Xp,Y^) 

=hb{q) - 2phbC^-^) - ‘2-phb{q) + hb{q + ^q)+pq. 


(13) 


Here hb{x) = —xlog 2 (x) — (1 — x) log 2 (l — x) denotes the binary entropy function. Thus, we obtain that 


maxi 

'T^hk 


(2i?i + i? 2 )max ^hb{q + ^q) - pq + €[hb{q + |g) - pq] 


(14) 


Clearly the main computational imprecision may^ arise from the estimation of the concave envelope; however 
as the next result shows; for this channel we obtain an explicit characterization of the concave envelope. 

Lemma 2. Consider the bivariate function f{p,q) as defined in (13) where (j>,q) S [0,1] x [0,1]. Then 

(i) ifp > i, 

= f{p,q)- 


(a) ifp < i 


^("’iy^>y(^'°) g + /(p,0) g€[0,l-2p] 


q 


fip, q) 


1 — 2p < q 


Proof. The second derivative with respect to q is 


d'^fjp, g) _ P (1 - 3g - 2pq) 
dq"^ qq\n2 {I + q){2q + pq) 

If p € (|, 1), then (15) is negative for q G (0,1), i.e., if p > /(p, q) is concave in q and £[/(p, g)] = f{p, q). 

If p G (0, i), then (15) has one solution, q* G (0,1). 


q 


* 


l-2p 

3-2p' 


In fact, f{p,q) is convex for q G (0, g*) and concave for q G (q*, I). Thus €[f{p,q)] consists of two parts. 

9 

First part is a tangent line from the point /(p, 0) to the function /(p, g) and the second part is equal to 

f{p, q)- 

^ In general since the concave envelope is computed over a single variable and the function is rather well behaved (at most 
two inflection points) when X 2 is binary, numerical computations using Matlab have yielded very high precision results even 
for the other counter examples listed. 
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To find the point where the tangent line meets the function, (q), we need to solve the following equation 

f{p, g) - f(p, 0 ) ^ dfjP, g) 

q dq q' 

Because the function is initially convex and then concave, the above equation will have at most one solution 
<7 7 ^ 0. One can verify that g = 1 — 2p is the required solution, and this completes the proof. □ 

Define F{p,q) for {p,q) € [0,1] x [0,1] as 


hb{q+^q)-pq +f{p,q) 9 > min{ 0,1 - 2 p} 

hb(.q+lq)-pq (16) 

+ /(p,l-2pK/(p,0) ^ ^ 


where f{p,q) is defined in (13). 

From Lemma 2 and (14), we know that 

max(2i?i + R 2 ) = max F{p,q). (17) 

Tihk P.9 

A tedious exercise shows that the concave envelope of F{p, q) w.r.t. {p, q) matches the function value F{po, qo) 
at^ {po,qo) = (0.507829413,0.436538150). Hence an upper bound on max(2i?i + R 2 ) is given by maximum 

'R'hk 

value of the supporting hyperplane to F{p, q) at po, qo, which is in turn upper bounded by F{po, qo) + |a| + | 6 | 

, and b = ^ . Evaluating the values we obtain an upper bound given by 

po ^ qo 

max(2i7i + R 2 ) < 1.107577. (18) 

'^hk 

On the other hand consider the following point in IZtwo given by 

Rl = Xi2;Yii,Yi2), i?2 = -^H{X21, F[22\Q), 

where P((An,Ai 2 ) = (0,0)) = po, P{{Xii, X 12 ) = (1,1)) = 1 - Po, and P((A 2 i,X 22 ) = (0,0)) = 
0.36(70, P((A2i,A22) = (0,1)) = P{{X2i,X22) = (1,0)) = 0.64(70, P((X2i,X22) = (1,1)) = I - IMqo- 
For this choice of distribution we get 2i?i + i ?2 = 1.1080356, which is strictly larger than the bound given in 
(18). This establishes the sub-optimality of the Han-Kobayashi region for the particular example considered 
in the Appendix. 

As mentioned in Section 2.2 the distribution of (A 21 , A 22 ) that outperforms the one-letter region is not 
the product distribution; but more surprisingly one is doing repetition coding on An, A 12 . 
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where a = ^ 

op 


^ Naturally, we choose a point that is numerically very close to the true maximum. 




